Modelling Genetic Variations with Fragmentation-Coagulation Processes

نویسندگان

  • Yee Whye Teh
  • Charles Blundell
چکیده

We propose a novel class of Bayesian nonparametric models for sequential data called fragmentation-coagulation processes (FCPs). FCPs model a set of sequences using a partition-valued Markov process which evolves by splitting and merging clusters. An FCP is exchangeable, projective, stationary and reversible, and its equilibrium distributions are given by the Chinese restaurant process. As opposed to hidden Markov models, FCPs allow for flexible modelling of the number of clusters, and they avoid label switching non-identifiability problems. We develop an efficient Gibbs sampler for FCPs which uses uniformization and the forward-backward algorithm. Our development of FCPs is motivated by applications in population genetics, and we demonstrate the utility of FCPs on problems of genotype imputation with phased and unphased SNP data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Nonparametric Modelling of Genetic Variations using Fragmentation-Coagulation Processes

We propose a novel class of Bayesian nonparametric models for variations in genetic data called fragmentation-coagulation processes (FCPs). FCPs model a set of sequences using a partition-valued Markov process which evolves by splitting and merging clusters. FCPs have a number of theoretically appealing properties: they are infinitely exchangeable, stationary and reversible, with equilibrium di...

متن کامل

Modelling Genetic Variations using Fragmentation-Coagulation Processes

We propose a novel class of Bayesian nonparametric models for sequential data called fragmentation-coagulation processes (FCPs). FCPs model a set of sequences using a partition-valued Markov process which evolves by splitting and merging clusters. An FCP is exchangeable, projective, stationary and reversible, and its equilibrium distributions are given by the Chinese restaurant process. As oppo...

متن کامل

An introduction to mathematical models of coagulation–fragmentation processes: A discrete deterministic mean-field approach

We summarise the properties and the fundamental mathematical results associated with basic models which describe coagulation and fragmentation processes in a deterministic manner and in which cluster size is a discrete quantity (an integer multiple of some basic unit size). In particular, we discuss Smoluchowski’s equation for aggregation, the Becker–Döring model of simultaneous aggregation and...

متن کامل

Dual random fragmentation and coagulation and an application to the genealogy of Yule processes

The purpose of this work is to describe a duality between a fragmentation associated to certain Dirichlet distributions and a natural random coagulation. The dual fragmentation and coalescent chains arising in this setting appear in the description of the genealogy of Yule processes.

متن کامل

Explosion Phenomena in Stochastic Coagulation – Fragmentation Models

First we establish explosion criteria for jump processes with an arbitrary locally compact separable metric state space. Then these results are applied to two stochastic coagulation–fragmentation models— the direct simulation model and the mass flow model. In the pure coagulation case, there is almost sure explosion in the mass flow model for arbitrary homogeneous coagulation kernels with expon...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011